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ABSTRACT 

A modification of the usual graphical representation 
of heterogeneous regressions is described that can aid in 
interpreting significant regions for linear or quadratic surfajres* 
The standard Johnson -Neyman graph is a bivariate plot with the 
criterion variable on the ordinate and the predictor variable on the 
abrcissa. Regression surfaces are drawn for each group. If there are 
regions of significance, their boundaries are noted either on the 
graph or in the text. If there is a manageable number of cases ^ the 
cases may be plotted on the graph with different symbols for the 
groups. This standard style of representation often suffers from 
inclusion of too many cases for realistic plotting of data and from 
the difficulty in translating a bivariate display of data into 
univariate distributional characteristics of the two groups. The 
modification proposed here alleviates these problems. Computer 
programs for solutions in both linear and quadratic surfaces are 
included. These programs, which were written in GAUSS, differ from 
earlier programs in that the input required is summary information 
available from standard computer package output rather than raw data 
and is entered interactively. A 39-item list of references, three 
graphs, and seven pages of computer programming language are 
provided. (TJH) 
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Graphical Description of Johnson-Neyman Outcomes for Linear and Quadratic Regression Surfaces 

William D. Schafer and Yuh-Yin Wang 
University of Maryland at College Park 

The presence of heterogeneous regression slopes in an analysis of covariance d^ign is analogous to 
the presence of interaction in factorial designs. It is puzzling, therefore, that current practices seem to 
follow conflicting recommendations in analyzing these two plans. Chosen as an example only because 
of recency, Maxwell and Delaney (1990) present an analysis of a two-way, fixed-efiects design using, as 
denominators for all F tests, the mean square residual about a model that includes interaction (see 
Chapttr 7), whereas they present an analysis of a one-way analysis of covariance design using, as the 
denominator of the F test for treatment efiects, the mean square residual about a model with the 
common within-groups regression slope. Tests for heterogeneous slopes and eventual recommendations 
about incorporating the more complete model for all tests are treated later under the heading of an 
extension and using a different name for the technique (see Chapter 9). Some common computer 
package procedures (e,g., SPSS ANOVA) include interaction among explained sources in analyses of 
variance and heterogeneity among unexplained sources in analyses of covariance. These practices are 
contradictory. Identifying heterogeneous regressions in analysis of covariance designs makes as muclt 
sense as identifying interaction in two-way designs and on that basis should be equally frequent. 
Furthermore, techniques for display and significance testing exist in either context. 

Interaction in factorial designs is often illustrated with the aid of interaction graphs of cell means, 
where the abscissa is, if one exists, a blocking factor, and the ordinate is the criterion. Contra.sts, with 
or without familywise error rate control, can be used to compare treattnent differences at levels of the 
blocking factor. Similarly, in analysis of covariance designs, heterogeneous slopes may be illustrated by 
a plot of the two or more group-specific regression equations on a graph where the abscissa is the 
covariate and the ordinate is the criterion. The plot may or may not display the data points. Since 
the scores are not grouped on the covariate, the parallel question about conditional treatment 
differences is usually addressed by partitioning the range of the covariate into regions where tlie 
differences are significant and where they are not. As in the case of factorial designs, this can be done 
with or without familywise error rate control. Known in general as the Johnson-Neyman procedure, a 
rnethiKl to accomplish this analysis was originally developed by Johnson and Neyman (1936) for the 
two-group case; Pedhazur (1982) gives details of the procedure and references to extensions for more 
than two groups, more than one covariate, and regions where the significance interpretation is 
simultaneous. 

Several authors have investigated the Johnson-Neyman technique and extended it beyond tlie one- 
or two-predictor cases. Potthoff (1964) described Scheffe-type solutions for simultaneous confidence 
intervals, multiple predictors (due to Abelson, 1953), multiple groups, and multiple criteria. Calien 
and Linn (1971) have compared three methods for defining regions of significance that differ in their 
coHservativeness. Solutions for multiple groups and multiple predictors have also been described by 
Forsler (1971, 1975). Aitkin (1972) has developed a ;5imilar technique. Shields (1978) has found that 
liie Johnson- Neyman technique is sensitive to heterogeneity of residual variances for equal as well as 
unequal group sixes, although Borich and Wundcrlich (1973) did not. Hollingsworth (1977) Iuls 
compared tests for honiogeneous variances in contexts including the Johnson-Neyman technique and 
I^igache and Graham (1976) have described an extension that does not assume that variances are 
homogeneous. Rogosa (1980, 1981) has discussed issues surrounding the Johnson-Neyman tecl'.nique, 
inchiding sinmltaneous significance adjustments, and Wilcox (1987) has described an adaptation of 
Tnkey-Kramer simultaneous procedures to multiple-group cases. Tsutakawa (1978) has developed a 
Hayesian solution and Houston (1987) k\s reported in Houston and Novick (1987) has demonstrated the 
ec|uivalence of Bayosian solutions using noninformative priors with classical Folutions^ assuming equal 
residual variances. Borich and Wtmderlich (1973) have discussed procedures for plotting results with 
two groups and two predictors and Wunderlich and Borich (1974) have described an extension to 
quadratic regressions. 



Several computer programs have been prepared to yield Johnson-Neyman solutions. Carroll and 
Wilson (1970), Ceurvorst (1979), Karpman (1980), Kush (1986), Lautenschlager (1987), Sclalfa (1987), 
and Strube (1988) have developed algorithm^i for this purpose in various programming languages and 
Karpman (1983, 1986) has discussed how to arrive at Johnson-Neyman solutions using SAS and SPSS 
transformational languages. A program that yields solutions for heterogeneous quadratic regrt^ions 
can be found in Wunderlich and Borich (1974), 

The ERIC and PsychlNFO document bases were searched for Johnson()Neyman in order to 
survey uses of the Johnson-Neyman technique. Berliner (1971) found interactions among memory 
aptitudes, note-taking, and test-like events. Keim-Abbott and Abbott (1977) found an interaction 
between instructional treatments ajid mental ability. McLeskey and Rieth (1982) compared reading 
disabled and normal children with IQ as the predictor. Dunbar and Novick (1985) found differential 
predictions by gender in Marine Corps training programs. Gamache and Novick (1985) have examined 
differential prediction of grade-point average by gender. Houston and Novick (1987) examined 
differential prediction by race. Reeves- Kazelsk is and Kazelskis (1987) found a disordinal interaction 
between treatments and a prior knowledge pretest. 

The purpose of this paper is to describe a modification to the usual graphical representation of 
heterogeneous regressions that can aid in interpreting significant regions for linear or quadratic 
regression surfaces. Computer programs for solutions in both cases are included. These differ from 
earlier programs in that the input required is summary information available from standard computer 
package output instead of raw data, and is entered interactively. In most applications, we feel a 
researcher will have run preliminary tests (for heterogeneous linear or quadratic regressions) before 
deciding to perform a Johnson-Neyman analysis; Wunderlich and Borich (1974) present a decision plan 
for making this judgment. The data are thus already prepared for analysis, likely in the format needed 
by a computer package, and the needed summary information is or easily can be made available, so a 
simple interactive program to find the Johnson-Neyman solution(s) will be more convenient to use than 
it would be to set up and run the analysis **from scratch'' nsing a raw-data algorithm. 

GRAPHICAL REPRESENTATION 

The standard Johnson-Neyman graph is a bivariate plot vvith the ciiterion variable on the 
ordinate and the predictor variable on the abscissa. Regression surfaces are drawn for each group. If 
there are regions of significance, their boundaries are noted either on the graph or in the text. If there 
arc a manageable number of cases, they may be plotted on the graph with different symbols for the 
groups. A prototypic example can be found in Walker and Lev (1953, p. 404). 

There are at least two difficulties with this style of presentation. First, the number of cases may 
be too large to be able realistically to plot the data. The graph would become too confusing and 
multiple overlapping points are difTicult to represent graphically. The second difficulty is that a 
bivariate display of the data does not readily translate into univariate distributional characteristics of 
*^e two groups. The latter is important, particularly for the predictor variable^ since the locations of 
significant differences are defined on the predictor and best interpreted in relation to its distribution. 

A convenient way to convey information about the univariate distribution of a sample is to use a 
box-arid-whisker plot. Although there is no general agreement about what constitutes a box-ard- 
whisker plot. Glass and Hopkins (1984, pp. 23-24) have recommended an approach that is easy to 
explain and implement. It is reconmicndcd here that a box-and-whisker plot for each group be drawn 
on the scale of the predictor variable and placed beneath the abscissa of the graph. Also, boundaries 
between significant and nonsignificant rtgions within the range of the data can be identified by vertical 
lines. In this manner, it is easy to grasp the relationships among the distributions of the groups and 
where on these distributions significant differences are located. 
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An example for the linear case taken from Guthrie, Schafer and Hutchinson (1991) is included in 
this paper. The graph describes differential predictions by gender for document reading variety when 
document achievement is the predictor (both scales were constructed using the data from the National 
Assessment of Educational Progress 1986 study of young adult literacy). The data are for 870 blacks 
and 1945 whites, clearly too many subjects to plot in any meaningful way. On the artificial document 
achievement scale (X; an ability measure), the percentile for blacks that formed the box-and-whisker 
plot were 5(.1.45), 10(^1.13), 25(-.42), 50(-.02), 75(.38), 90(.98), and 95(1.30). The corresponding 
percentiles for whites were 5(-.43), 10(".28), 25(.16), 50(.84), 75(1.46), 90(2.18), and 95(2.43). The 
regression lines were Y' = 8.62 -f 2.17X for blacks and Y' = 9.26 + L75 for whites. The region of 
nonsignificance (confidence coefficient = .95) was between .53 and 35.12. The vertical line in the 
diagram corresponds to .53 on the document achievement scale; the other boundary was outside the 
range of the data and thus was not drawn. The figure seems to present this information in an 
appeaiiag and interpretable way. 

Two examples for the curvilinear case taken from Guthrie, Schafer and Wang (1991) are also 
included. In both examples, the criterion is reading achievement and the groups are black males and 
black females. The predictors are study strategies, showing nonintersecting quadratic regressions with 
one solution in the range of the data, and general reading activities, showing intersecting regressions 
witli four solutions in the range of the data (both predictors are derived variables). The data were part 
of the 1986 NAEP reading assessment. Because both predictors were derived scales made up of 
variables in different partitions of the items in the balanced incomplete block design by which the 
item? were administered to the participants, the percentiles used in the box and whisker plots were 
c;timated assuming normality in each group instead of directly from observed distributions and are 
therefore only crude approximations to the actual percentiles that might otherwise have been obtained. 

COMPUTER PROGRAMS 

Computer programs for Johnson-Neynian solutions for two groups were written, one for linear 
solutions and one for quadratic solutions. These programs were designed to be interactive and to take 
as input summary information available from the output of regression procedures found in standard 
statistical packages; our assumption is that the user will have already determined that heterogeneity of 
regression exists and will have found the linear or quadratic regression equations, the univariate means 
and standard deviations and the bivariate correlations among the variables for each group along with 
the group sizes. 

The programs were written in GAUSS. This language was chosen because it is fast and maintains 
a high degree of precision. It also has available two procedures necessary for the solution: one that 
returns a p-value for an F distribution and one that returns the solutions to general equations of the 
nth degree. 

The source code and a sample output are given for the linear solution. The data used were taken 
from an example in Walker and Lev (1953, p. 403). The confidence coefficient and the adjustments 
necessary for a simultaneous region (Potthoff, 1964) are easily changed where the fcv procedure is 
called. The solution agrees with that in Walker and Lev (1953). 

The source code and a sample output are also given for the quadratic solution. This solution was 
develo}>ed using the two-predictor solution presented by Walker and Lev (1953, pp. 406-407). Their 
predictors were labeled X and Z. The solution was algebraically obtained by substituting for Z and 
simplifying. Since a quartic equations results, up to four solutions are possible, making interpretation 
less straightforward than for tlie line.ir case. Therefore, th program requests the domain of the 
predictor variable and outputs significance decisions for eleven equally spaced points in that interval. 
It is suggested in using the program that the observed range of the predictor is used as the end-points 
of its domain, effectively limiting interpretation to the interval where data are present. In order to 
check the program, a solution for an example was developed by hand and compared with the results. 
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new; 

• proc fcv(ndf ,ddf, alpha) ; 

local f ,fl,fh,ndf ,ddf ,alpha,a; 

a=0; 

fl=0; 

fh»200; 

f«i; 

do until abs(a-alpha)<. 000001; 
a = cdffc(f ,ndf ,ddf) ; 
if a < alpha; 
fh « f; 

f = (fl + f) / 2; 
else; 
fl « f; 

f = (fh + f) / 2; 
endif ; 

endo ; 
retp(f) ; 
endp; 

print "What is the size of group i?"; 
nl « con(l, 1) ; 

Iprint "The size of group 1 is " nl; 

print "What is the mean of group 1 on the predictor?"; 
roxl = con (1,1) ; 

Iprint "The mean of group 1 on the predictor is " mxl; 

print "What is the standard deviation of group 1 on the predictor?"; 

sdxl = con(l, 1) ; 

Iprint "The standard deviation of group 1 on the predictor is " sdxl; 
vxl = sdxl 2; 

print "What is the standard deviation of group 1 on the criterion?"; 
sdyl = con(l, 1) ; 

Iprint "The standard deviation of group 1 on the criterion is " sdyl; 
vyl = sdyl 2; 

print "What is the intercept for group 1?"; 
al = con(l, 1) ; 

Iprint "The intercept for group 1 is " al; 
print "What is the slope for group 1?"; 
bl = con(l, 1) ; 

Iprint "The slope for group 1 is " bl; 
print "What is the size of group 2?"; 
n2 = con(l, 1) ; 

Iprint "The size of group 2 is " n2; 

print "What is the mean of group 2 on the predictor?"; 
mx2 = con(l, 1) ; 

Iprint "The mean of group 2 on the predictor is " mx2; 

print "What is the standard deviation of group 2 on the predictor?"; 

sdx2 = con(l, 1) ; 

Iprint "The standard deviation of group 2 on the predictor is " sdx2; 
vx2 = sdx2 2; 

print "What is the standard deviation of group 2 on the criterion?"; 
sdy2 - con(l, 1) ; 

Iprint "The standard deviation of group 2 on the criterion is " sdy2 ; 
vy2 = sdy2 "2; 

print "What is the intercept for group 2?"; 
a2 = con(l, 1) ; 

Iprint "The intercept for group 2 is " a2; 
print "What is the slope for group 2?"; 
b2 = con(l, 1) ; 

Iprint "The slope for group 2 is " b2; 
rl = bl * sdxl/sdyl; 

« 1 2 
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Iprint "The correlation for group 1 is rl; 
r2 = b2 * sdx2/sdy2; 

Iprint "The correlation for group 2 is " r2; 

df = nl + n2 - 4; 

f = fcv(l,df , ,05) ; 

cyyl = (nl-1) * vyl; 

cyy2 = (n2-l) * vy2; 

cxxl = (nl-1) * vxl; 

cxx2 « (n2-l) * vx2; 

cxyl s= rl * sqrt(cxxl * cyyl) ; 

cxy2 = r2 * sqrt(cxx2 * cyy2) ; 

t = (cyyl-cxyl*2/cxxl+cyy2-cxy2^2/cxx2) ; 

a = -f/df * t * (l/cxxl+l/cxx;i)-i-(bx-b2) '2; 

a = a/2; 

b = f/df * t * (mxl/cxxl + inx2/cxx2) + (al-a2) * (bl-b2) ; 

c = -f/df * t * ( (nl+n2) /nl/n2 mxl^2/cxxl + mx2"2/cxx2) + (al-a2)"2; 

c = c/2; 

X « polyroot (a|b|c); 

print "Below are the two solutions at which the difference between the"; 
Iprint "Below are the two solutions at which the difference between the"; 
print "means of the two groups is exactly significantly different from zero."; 
Iprint "means of the two groups is exactly significantly different from zero."; 
print "In the first column are values of X, the predictor,"; 
Iprint "In the first column are values of X, the predictor,"; 
print "but only those solutions having zero in the second column are real."; 
Iprint "but only those solutions having zero in the second column are real."; 
print x; 
Iprint x; 
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The "Size of group l is 8.0000000 
* The mean of group i on tba predictor is 0.031250000 
The standard deviation of group 1 on the predictor is 0.24486120 
The standard deviatirn of group 1 on the criterion is 0. 5610962 0 

The intercept for group l is 0.96750000 
The slope for group i is 1.4400000 
The size of group 2 is 10.000000 

The mean of group 2 on the predictor is -0.19000000 

The standard deviation of group 2 on the predictor is 0.27448860 

The standard deviation of group 2 on the criterion is 0.14854970 

The intercept for group 2 is 0.20630000 

The slope for group 2 is -0.088100000 

The correlation for group 1 is 0.62841297 

The correlation for group 2 is -0.16279027 

Below are the two solutions at which the difference between the 

means of the two groups is exactly significantly different from zero. 

In the first column are values of X, the predictor, 

but only those solutions having zero in the second column are real. 

-5.0798438 0.00000000 
-0.21572336 0.00000000 
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• proc fcv(ndf,ddf, alpha) ; 
• local f ,fl,fh,ndf ,ddf ,alpha,a; 
a=0; 
fl=0; 
fh=200; 
f=l? 

do until abs(a-alpha) <. 000001; 
a « cdffc(f ,ndf ,ddf) ; 
if a < alpha; 
fh = f ; 

f = (fl + f) / 2; 
else; 
fl = f; 

f = (fh + f) / 2; 
endif ; 

endo ; 
retp(f); 
endp ; 

print "What is the size of group 1?"; 
nl « con(l, 1) ; 

Iprint "The size of group 1 is " nl; 

print "What is the mean of group 1 on the predictor?"; 
mxl = cor-(l, 1) ; 

Iprint "The mean of group 1 on the predictor is " mxl; 

print "What is the mean of group 1 on the squared predictor?"; 

mzl = con(l, 1) ; 

Iprint "The mean of group 1 on the squared predictor is; " mzl; 
print "What is the standard deviation of group 1 on the predictor?"; 
sdxl = con(l, 1) ; 
vxl = sdxl '^2; 

Iprint "The standard deviation of group 1 on the predictor is: " sdxl; 
print "What is the st. dev. of group 1 on the squared predictor?"; 
sdzl « con(l, 1) ; 
vzl = sdzl 2; 

Iprint "The st. dev. of group 1 on the squared predictor is: " sdzl; 
print "What is the st. dev. of group 1 on the criterion?"; 
sdyl - con(l, 1) ; 
vyl = sdyl * 2; 

Iprint "The st. dev. of group 1 on the criterion is: " sdyl; 
print "What is the X-Y correlation for group 1?"; 
rxyl = con(l, 1) ; 

Iprint "The X-Y correlation for group l is: " rxyl; 
print "What is the Xsquared-Y correlation for group 1?"; 
ryzl = con (1,1) ; 

Iprint "The Xsquared-Y correlation for group l is: " ryzl; 
print "What is the X-Xsquared correlation for group l?"; 
rxzl = Con (1,1) ; 

Iprint "The X-Xsquared correlation for group 1 is: " rxzl; 
print "What is the intercept for group 1?"; 
al = con (1 , 1) ; 

Iprint "The intercept for group 1 is: " al; 
print "What is the slope on X for group 1?"; 
bxl = con(l, 1) ; 

Iprint "The slope on X for group 1 is: " bxl; 
print "What is the slope on Xsquared for group 1?"; 
, = con (1,1) ; 

Iprint "The slope on Xsquared for group 1 is: " bzl; 
prir.t "What is the size of group 2?"; 
n2 = con(l, 1) ; 

Iprint "The size of group 2 is " n2; 
o 



print "What is the mean of group 2 on the predictor?"; 
inx2 = con(l, 1) ; 

Iprint "The mean of group 2 on the predictor is: " mx2; 

print "What is the mean of group 2 on the squared predictor?"; 

mz2 = con (1,1) ; 

Iprint "The mean of group 2 on the squared predictor is: " mz2; 
print "What is the standard deviation of group 2 on the predictor?"; 
sdx2 » con (1,1) ; 
vx2 « sdx2 '* 2; 

Iprint "The standard deviation of group 2 on the predictor is: " sdx2 
print "What is the st. dev. of group 2 on the squared predictor?"; 
sd22 = con(l, 1) ; 
vz2 = sdz2 ^ 2; 

Iprint "The st. dev. of group 2 on the squared predictor is: " sdz2; 
print "What is the st. dev. of group 2 on the criterion?"; 
sdy2 = con (1,1); 
vy2 = sdy2 2 ; 

Iprint "The st. dev. of group 2 on the criterion is: " sdy2; 
print "What is the X-Y correlation for group 2?"; 
rxy2 = Con (1,1); 

Iprint "The X-Y correlation for group 2 is: " rxy2; 
print "What is the Xsquared-Y correlation for group 2?"; 
ry22 = con (1,1) ; 

Iprint "The Xsquared-Y correlation for group 2 is: " ryz2; 
print "What is the X-Xsquared correlation for group 2?"; 
rxz2 = con(l, 1) ; 

Iprint "The X-Xsquared correlation for group 2 is: " rxz2; 
print "What is the intercept for group 2?"; 
a2 = con(l, 1) ; 

Iprint "The intercept for group 2 is: " a2; 
print "What is the slope on X for group 2?"; 
bx2 = con(l, 1) ; 

Iprint "The slope on X for group 2 is: " bx2 ; 
print "What is the slope on Xsquared for group 2?"; 
bz2 ~ con(l, 1) ; 

Iprint "The slope on Xsquared for group 2 is: " bz2; 
print "What is the lower limit of the domain of X?"; 
Ix = con(l, 1) ; 

Iprint "The lower limit of the domain of X is: " Ix; 
print "What is the upper limit of the domain of X?"; 
ux = con(l, 1) ; 

Iprint "The upper limit of the domain of X is: " ux; 
dif f = (ux-lx) /lO; 
df = nl + n2 6; 
f = fcv(l,df , .05) ; 
cyyl = (nl-1) * vyl; 
cyy2 = (n2-l) * vy2; 
cxxl = (nl-1) * vxl; 
czzl = (nl-l) * vzl; 
CXX2 = (n2-l) * vx2; 
C2Z2 - {n2-l) * vz2; 
cxyl ^ rxyl * sqrt(cxxl * cyyl) 
cxy2 = rxy2 * sqrt(cxx2 * cyy2) 
cyzl = ryzl * sqrt(czzl * cyyl) 
cyz2 = ryz2 * sqrt(czz2 * cyy2) 
cxzl = rxzl * sqrt(cxxl * czzl) 
CXZ2 = rxz2 * sqrt(cxx2 * czz2) 

P = (cyyl-cxyl*bxl-cyzl*bzl) + (cyy2~cxy2*bx2-cyz2*bz2 ) ; 
t = f*p/ (nl+n2-6) ; 

a = czzl/ (cxxl*czzl-cxzl*2) + czz2/ (cxx2*czz2-cxz2"2) ; 
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• a =• (bxl-bx2) "2 - t * a; 

b cxzl/ (CXXl*CZ2l-CXZl*2) + cxz2/ (cxx2*czz2-cxz2^2) f 
b ^ (bxl-bx2)*(bzl-bz2) + t * b; 

C = CXXl/ (CXXl*CZZl-CXZl^2) + CXX2/ (cxx2*czz2-cxz2*2) ; 
c = (bzl-bz2)*2 - t * c; 

e«(inxl*czzl-mzl*cxzl) / (cxxl*czzl-cxzl'"2) + (mx2*czz2-xnz2*cx22) / (cxx2*czz2-cxz2*2) ; 
e = (al-a2) *(bxl-bx2) + t * e; 

g«(inzl*cxxl-inxl*cxzl) / (cxxl*c2zl-cxzl*2) + (mz2*cxx2-inx2*cxz2) / (cxx2*czz2-cxz2*2) ; 

g = (ai-a2)*(bzl-bz2) + t * g; 

hla * cxxl*cz2l/ (cxxl*czzl-cxzl'*2) ; 

h2a = cxx2*czz2/ (cxx2*czz2-cxz2*2) ; 

hlb = inxl*2/cxxl-2*inxl*mzl*cxzl/cxxl/czzl+xnzl'*2/czzl; 

h2b = Inx2*2/cxx2-2*mx2*mz2*cxz2/cxx2/czz2+inz2*2/czz2; 

h = (al-a2)"2 - t * (hla*hlb + h2a*h2b + (nl+n2) /nl/n2) ; 

fl = c; 

f2 = 2 * b; 

f3 = a + 2*g; 

f4 = 2 * e; 

f5 = h; 

X » polyroot (f 1 | f 2 | f 3 | f 4 | f 5) ; 

print "Below are the four solutions at which the difference between the"; 

Iprint "Below are the four solutions at which the difference between the"; 

print "means of the two groups is exactly significantly different from zero."; 

Iprint "means of the two groups is exactly significantly different from zero."; 

print ''In the first column are values of X, the predictor,"; 

Iprint "In the first column are values of X, the predictor,"; 

print "but only those solutions having zero in the second column are real."; 

Iprint "but only those solutions having zero in the second column are real."; 

print x; 

Iprint x; 

X - Ix-diff; 

print "The critical value of F is " f " at 1 & " df " degrees of freedom"; 
Iprint "The critical value of F is " f " at 1 & " df " degrees of freedom"; 
do while x < ux; 
X = x + diff; 

d « (al-a2) + (bxl-bx2)*x + (bzl-bz2) *x"2; 

ql = hla* ( (x-mxl) *2/cxxl+(x'*2-mzl) *2/czzl-"2*cx2l* (x-mxl) * (x*2-mzl) /cxxl/czzl) ; 
q2 = h2a* ( (x-mx2) '*2/cxx2+ (x*2-mz2) *2/czz2-2*cxz2^ (x-mx2) * (x*2-mz2) /cxx2/czz2) ; 
q = {nl+n2)/nl/n2 + ql -f q2; 
den ~p*q/ (nl+n2-6); 
t2 = d"2/den; 

print "At X = " X " the value of F is " t2; 
Iprint "At X = " X " the value of F is " t2; 
if t2 < f; 

print "The difference between the two groups is not significant."; 
Iprint "The difference between the two groups is not significant."; 
else; 

print "The difference between the two groups is significant."; 
Iprint "The difference between the two groups is significant."; 
endif ; 
endo; 
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The* size of group i is 383.00000 

The mean of jup i on the predictor is 2.4500000 

The mean of group 1 on the squared predictor is; 7.3400000 

The standard deviation of group 1 on the predictor is; 1.2083000 

The St. dev. of group l on the squared predictor is: 6.3812000 

The St. dev. of group 1 on the criterion is: 9.0277000 

The X-Y correlation for group 1 is; 0.053170747 

The Xsquared-Y correlation for group 1 is: 0.074295255 

The X-Xsquared correlation for group i is: 0.96621986 

The intercept for group 1 is: 55.900000 

The slope on X for group 1 is: 2.1500000 

The slope on Xsquared for group 1 is: -0.38000000 

The size of group 2 is 409.00000 

The mean of group 2 on the predictor is: 2.7100000 

The mean of group 2 on the squared predictor is: 9.2200000 

The standard deviation of group 2 on the predictor is: 1.3527700 

The St. dev. of group 2 on the squared predictor is: 7.5953900 

The St. dev. of group 2 on the criterion is: 9.9769700 

The X-y correlation for group 2 is: 0.15559475 

The Xsquared-Y correlation for group 2 is; 0.062946167 

The X-Xsquared correlation for group 2 is: 0.97227624 

The intercept for group 2 is: 49.300000 

The slope on X for group 2 is: 3.24 00000 

The slope on Xsquared for group 2 is: -0.38000000 

The lower limit of the domain of X is: 0,00000000 

The upper limit of the domain of X is: 5.0000000 

Below are the four solutions at which the difference between the 

means of the two groups is exactly significantly different from zero. 

In the first column are values of X, the predictor, 

but only those solutions having zero in the second column are real. 

3.4551557 1.8045932 
3.4551557 -1.8045932 
4.0886800 0.00000000 
-0.47620005 0.00000000 
The critical value of F is 3.8531229 at 1 & 786.00000 degrees of 

freedom 

At X = 0.00000000 the value of F is 6.4847231 

The difference betweer. the two groups is significant. 

At X = 0.50000000 the value of F is 12.214326 

The difference between the two groups is significant. 

At X = 1.0000000 the value of F is 23.092253 

The difference between the two groups is significant. 

At X = 1.5000000 the value of F is 31.537363 

The difference between the two groups is significant. 

At X = 2.0000000 the value of F is 25.703420 

The difference between the two groups is significant. 

At X = 2.5000000 the value of F is 17.870088 

The difference between the two groups is significant. 

At X = 3.0000000 the value of F is 13.120948 

The difference between the two groups is significant. 

At X = 3.5000000 the value of F is 9.3430146 

The difference between the two groups is significant. 

At X = 4.0000000 the value of F is 4.6310035 

The difference between the two groups is significant. 

At X = 4.5000000 the value of F is 1.3514992 

The difference between the two groups is not significant. 

At X = 5.0000000 the value of F is 0.28075711 

The difference between the two groups is not significant. 
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